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^ 1. Abstract 

^ Drug resistance remains a major problem for the treatment of HIV. Resistance can occur 

due to mutations that were present before treatment starts or due to mutations that occur 
during treatment. The relative importance of these two sources is unknown. We study three 
^ different situations in which HIV drug resistance may evolve: starting triple-drug therapy, 

treatment with a single dose of nevirapine and interruption of treatment. For each of these 
three cases good data are available from literature, which allows us to estimate the probability 
that resistance evolves from standing genetic variation. Depending on the treatment we find 
probabilities of the evolution of drug resistance due to standing genetic variation between 
and 39%. For patients who start triple-drug combination therapy, we find that drug 
resistance evolves from standing genetic variation in approximately 6% of the patients. We 
use a population-dynamic and population-genetic model to understand the observations and 
to estimate important evolutionary parameters. We find that both, the effective population 
size of the virus before treatment, and the fitness of the resistant mutant during treatment, 
are key-parameters that determine the probability that resistance evolves from standing 
genetic variation. Importantly, clinical data indicate that both of these parameters can be 
manipulated by the kind of treatment that is used. 
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2. Introduction 

For most HIV patients, treatment with modern antiretroviral therapy leads to a rapid 
decUne of viral load (VL) of several orders of magnitude. However, when the virus acquires 
resistance to one or more drugs, treatment can fail. It is still an open question whether the 
mutations responsible for resistance originate usually from standing genetic variation (also 
referred to as pre-existing mutations or minority variants), or from new mutations which 
occur during therapy. In fact, there is no biological system for which the relative role of pre- 
existing and new mutations is well known (Barret 2008). The case of HIV drug resistance 
may serve as a good case-study for evolutionary biologists. In addition, understanding the 
role of pre-existing drug resistance mutations may help guide the design of better treatment 
strategies to reduce the risk that resistance evolves in patients. 

In this paper we will look at the establishment of drug resistance mutations in three 
different situations: (1) when triple-drug therapy (ART) is started for the first time, (2) 
when pregnant women are treated with a single dose of nevirapine to prevent infection of 
the baby during birth and (3) when standard therapy is interrupted and restarted. We will 
argue that standing genetic variation plays a crucial role in each of these cases. We find that 
the probability that resistance mutations become established in each of these cases can be 
understood by using a simple population genetic model. 

For readers who are not familiar with HIV, it is important to know that the genotype- 
phenotype map for drug resistance in HIV is very well known. Lists of the important re- 
sistance mutations for each drug are published (e.g. in the International AIDS SocictyUSA 
drug resistance mutations list, Johnson 2010), so that doctors can compare the genotype of 
the virus of a patient before treatment with this list to decide which drugs to prescribe. The 
aim of treatment is to achieve viral suppression. If treatment fails despite adherence to the 
regimen, a second genotypic test will be performed to see whether the virus has acquired 
new resistance mutations. Since the second half of the 1990s, treatment is usually with a 
combination of three drugs, which are chosen such that mutations which confer resistance 
against one of the drugs do not confer cross-resistance against the other two drugs. Soon it 
became clear that triple-drug therapy was an enormous success and saved the lives of many 
HIV patients (Mocroft 1998). One reason why therapy with three drugs works better than 
treatment with one or two drugs is that the rate at which resistance evolves is slower when 
patients are treated with three drugs (Deeks 1998). It is commonly thought that resistance 
does not evolve in patients on triple-drug therapy because it would require a viral particle 
to acquire three mutations at the same time. However, in patients who are treated with 
triple-drug therapy, it is often observed that resistance against one of the drugs evolves, at 
least initially. Data from several cohort studies in different parts of the world, such as from 
Canada (Harrigan et al 2005) and the UK (UK CHIC cohort study, Cozzi-Lepri 2005) clearly 
show that in most patients who fail therapy due to resistance, the virus is resistant against 
one of the drugs and almost never against all three. The UK study, for example, reports that 
out of 4306 patients who started therapy between 1996 and 2003, after two years of therapy, 
13% have drug resistance, but less than half of these patients (6%) have resistance against 
more than one class of drugs and a only small number of patients (1%) have resistance against 
3 classes of drugs, even though all patients of this cohort were treated with three classes of 
drugs. These data show that treatment can fail due to resistance against one of the drugs 
in a regimen. The viruses that have acquired resistance against two or three classes of drugs 
may have acquired several drug resistance mutations at the same time or they may have 
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acquired resistance against one drug first and subsequently acquired resistance against the 
other classes. In order to distinguish between these two possibilities, a very dense sampling of 
a patient's viral population would be needed. For now, we will assume that in most patients 
the accumulation of resistance mutations is a gradual process. 

For many common drugs, especially reverse transcriptase inhibitors, a single mutation 
can confer resistance against the drug and only a small number of mutations is respon- 
sible for resistance in most patients. For example, resistance against the drug nevirapine 
is almost always due to one of two amino acid changes, namely K103N or Y181C in the 
reverse-transcriptase gene (Paredes et al 2010). Because of the importance of a small num- 
ber of mutations, several studies have investigated whether these mutations are present in 
untreated patients due to transmitted drug resistance or due to spontaneous mutation. Re- 
cent studies have used allele-specific PGR and related methods to determine the frequency of 
several important mutations in untreated patients. Low-frequency drug resistance mutations 
(DRMs), likely due to spontaneous mutation (and not transmitted from other patients) were 
detected in up to 40% of patients (see Gianella and Richman 2010 for an overview). The 
detection of drug resistance mutations in untreated patients, together with the knowledge 
that a single mutation can confer resistance against a drug and allow viral escape, suggest 
that pre-existing resistance mutations (or standing genetic variation in the population genetic 
jargon) may play an important role in the evolution of drug resistance in HIV. 

Throughout the paper, we will assume that a single mutation can allow viral escape and we 
focus on the probability that such a first drug resistance mutation becomes established (i.e., 
it reaches such a frequency that it can be expected to become the majority variant unless 
treatment is stopped or changed quickly). What happens after a first mutation has become 
established, or how fast such an established mutation wanes in the absence of treatment are 
important questions, but they fall outside the scope of this study. In this paper, 'triple-drug 
therapy" refers to treatment with two drugs of the class NRTI plus either an NNRTI or an 
unboosted PI (for a list of abbreviations in the paper, see Table 2). The results are hkely to 
be different for other drug combinations. 

Starting therapy. When a patient starts therapy for the first time, one would expect that 
there should be a substantial probability that drug resistance evolves due to pre-existing 
DRMs. Indeed, recent studies have shown that the presence of drug resistance mutations 
at low frequency {<!%) increases the risk that treatment fails (e.g., Johnson 2008, Geretti 
2009, Paredes 2010, see Li et al 2011 for a review). However, the situation is not as simple 
as one may hope: even if no pre-existing DRMs can be detected, resistance mutations may 
become established quickly, and even if DRMs are detected, treatment is still successful in 
the majority of patients. We will attempt to understand those observations using population 
genetic theory. Other authors have looked at the question of pre-existing DRMs previously 
(e.g., Bonhoeffer and Nowak 1998, Ribeiro and Bonhoeffer 2000), however, it is worth re- 
considering the topic. First of all, we now have a wealth of data available for pre-existing 
DRMs and the establishment of drug resistance mutations in HIV patients, and secondly, we 
now have a better theoretical framework to consider the role of standing genetic variation for 
adaptation (Hermisson and Pennings, 2005). 

Prevention of mother to child transmission (PMTGT). Pregnant women in low resource 
settings are often treated with a single dose of the non-nucleoside reverse transcriptase in- 
hibitor neverapine when labor starts. Single dose nevirapine (sdNVP) is the cheapest and 
simplest way to reduce the probability of mother-to-child-transmission, but it is shown to 
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lead to the establishment of drug resistance mutations in the mothers and the babies. In a 
meta-analysis, Arrive et al (2007) found that, in 7 different studies, on average 44% of the 
patients treated with sdNVP had detectable NVP resistance mutation several weeks after the 
treatment. The presence of such mutations makes future treatment of these women harder 
(Lockman et al 2007). To avoid the establishment of resistance mutations, several alterna- 
tive strategies are used in combination with sdNVP. We will use the same population genetic 
framework as in the other two cases to try to understand why sdNVP leads to establishment 
of resistance mutations in so many patients, and how this can be avoided. In the current 
study we will only focus on the probability that NVP resistance mutations become estab- 
lished during treatment for PMTCT. The issue of how these mutations wane and possibly 
resurface when treatment is started again is important and interesting but falls outside the 
scope of the current paper. 

Treatment interruptions. It was long suspected that treatment interruptions lead to drug 
resistance. Indeed, cohort studies show that treatment interruptions due to non-adherence 
are associated with faster accumulation of drug resistance mutations (Tam et al 2008, Gard- 
ner 2010, Lima 2010). Clear evidence that treatment interruptions of at least a couple of 
weeks lead to the establishment of resistance mutations comes from clinical trials (e.g., Yerly 
2003, Danel 2006, 2009) which were done in a time when it was believed that treatment 
interruptions may be beneficial for patients. In 2006 the SMART trial was stopped because 
treatment interruptions were shown to have a negative effect on patients' health (El-Sadr 
2008). However, treatment interruptions still occur, for example, when a patient is forget- 
ful or is unable to purchase drugs due to financial or logistic barriers. It is important to 
understand how treatment interruptions lead to resistance and whether this effect can be 
avoided. 

The main idea that currently governs the thinking about treatment interruptions and re- 
sistance is that insufficient drug-levels allow for replication and, at the same time, select for 
resistance (e.g., Taylor 2007, Fox 2008, Gardner 2010). This effect is aggravated when drugs 
that are part of combination therapy have very different half-lifes, so that interrupting com- 
bination therapy can result in effective monotherapy. It is generally believed that this "tail 
of monotherapy" is the main reason why treatment interruptions lead to drug resistance. 
However, several observations are not compatible with the "tail" hypothesis. For example. 
Fox et al (2008) found no significant difference in the number of resistance mutations after si- 
multaneous, "staggered" or "switched" treatment interruptions in patients from the SMART 
trial (a "staggered" stop means that the long half-life drug is interrupted several days before 
the other drugs and a "switched" stop means that before interrupting, patients switch to a 
regimen with only short half-life drugs). In addition, the "tail" hypothesis fails to explain 
why treatment interruptions increase the risk of resistance in patients on protease inhibitor- 
based (PI) regimens which do not have long half-lifes (Dybul 2003, Yerly 2003, Arnedo- Valero 
2005, Henry 2006, Ruiz 2007, Darwich 2008). Another explanation is needed to understand 
the observed patterns. 

When treatment is interrupted, the viral load rapidly increases until it has reached its 
original level after approximately four weeks (Davey et al 1999). Basic population genetics 
tells us that such population growth also leads to an increase in the probability that DRMs 
are present. When treatment is started again, selection may work on such pre-existing 
mutations, which provides a simple explanation for how treatment interruptions lead to the 
establishment of resistance mutations. 
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In this paper we will attempt to explain the observed patterns by considering selection 
on pre-existing variation and selection on new mutations. Throughout the paper, we use 
a mathematical model for adaptation from standing genetic variation which we developed 
previously (Hermisson and Pennings, 2005) and forward-in-time, individual-based computer 
simulations. The model captures mutation, drift and selection, including changing selection 
pressures (due to stopping and starting of therapy) which lead to changes in population size. 
Because we only focus on the establishment of the first drug resistance mutation, we can 
ignore epistatic interactions between different drug-resistance mutations and recombination. 
In each of the three cases of interest, we use published data on the percentage of patients with 
established drug resistance mutations to estimate important parameter values (for starting 
ART or sdNVP) and to predict outcomes (for treatment interruptions). 

3. Model, assumptions and fixation probability of a drug resistance 

mutation 

The model we use in this paper describes the population dynamics and population genetics 
of a panmictic viral population in a single patient. Details of the model can be found in the 
supplementary material. We assume that as long as the patient is not on anti-retroviral 
therapy, the viral population will be stable at population size Nu (u for untreated). Drugs 
reduce the fitness of the wildtype virus to below 1 so that the population will shrink. We 
assume that there is a large reservoir of latently infected cells of which a fixed number (/) 
become activated per generation, so that the virus can not die out. Drug resistant virus 
can be created by mutation and is assumed to be resistant against one of the drugs in the 
treatment regimen. If the patient is not taking drugs, the driig resistant virus is less fit than 
the wildtype by a factor Crei (relative cost of the resistant virus), but if the patient is taking 
drugs, the resistant virus has a fitness that is higher than 1 {FmART > 1), whereas the 
wildtype has a fitness lower than 1 {FwtART < !)• In reality, there may also be resistance 
mutations that confer resistance against one of the drugs, but that do not lead to a fitness 
higher than 1. Such mutations will quickly die out and can safely be ignored in the model. 
Throughout the paper we focus on the the processes that allow a first major drug resistance 
mutation to become established in the patient. Patients are assumed to be ART-naive and 
have no transmitted drug resistance. 

Evolutionary biologists have long known that most mutations will be lost by genetic drift 
even if they confer a fitness benefit (Haldane 1927). This is also true for drug resistance 
mutations (DRMs) in patients on anti-retroviral therapy, although it is all too often ignored 
in drug resistance studies. The clinical relevance of this old result has recently become very 
clear. It was found in several studies that even though low frequency DRMs increase the 
risk of treatment failure and establishment of drug resistance, the majority of patients with 
detected low frequency DRMs will respond well to treatment (Paredes et al 2010). This 
result shows that DRMs can die out, even if they have reached frequencies high enough to 
be detected. The reason is probably that most viral particles will not infect any new cells 
and produce no new viral particles, even if, on average, they produce more than 1. Because 
of stochastic effects, not every particle will have the same number of offspring and if some 
have many, then others must have 0. 

The probability that a DRM becomes established in the patient depends on the number 
of copies that are present, the average number of offspring of the drug resistant particles 
and the variance in offspring number. Traditionally, fixation or establishment probabilities 
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are calculated using the relative fitness difference between the mutant and the wildtype, 
but in the case of HIV it is more useful to use the fitness of the mutant virus to calculate 
its establishment probability. The reason is that anti-retroviral therapy works so well that 
wildtype fitness may be very low (much lower than 1). In such case fitness of the mutant 
may not be related to the fitness of the wildtype and because the wildtype cannot grow, the 
two types do not compete for resources. In other words, the mutant can occupy a niche that 
is not occupied by the wildtype. In those cases the establishment probability of the mutant 
will be approximately P^st ~ ^*^^'^~^^ where a is the variance in offspring number. In the 
simulations and throughout this paper, we use the variance effective population size, in which 
case one can assume that cr = 1, so that 

(1) Pest ^2{Fm- 1) 

It is important to realize that if the establishment probability of a DRM depends on its 
absolute fitness, anything that reduces its fitness will reduce the establishment probability. 
For example, if a drug that is added to a regime reduces fitness of both wildtype and resis- 
tant virus, then it will reduce the probability that a pre-existing resistant mutant becomes 
established. This is true even if the effect of the added drug on wildtype and resistant virus 
is exactly the same. Similarly, if the immune system works well, this may also reduce the 
probability of establishment. 

In most population genetics models, the focus is on the fixation probability, rather than 
the establishment probability of a mutation. And in many models, if a mutation becomes 
established, it will go to fixation. However, if selection pressures change, establishment does 
not necessarily lead to fixation. This is especially clear when we will later consider the effect 
of a single-dose of nevirapine. A few weeks after a single dose of nevirapine, nevirapine 
resistance mutations can be detected in a large proportion of patients, but these mutations 
may never take over the whole viral population, because the treatment duration is very short 
and wildtype virus will quickly become a majority again (see for example, Lockman 2007). 
In fact, the standard results on fixation probability (Haldane 1927) are really results on 
establishment probabilities, so we can use them without problems. 

Psgv vs. Pnew. For drug resistance to evolve, the viral population needs viral particles 
that carry drug resistance mutations. Such particles may already be present before treatment 
is started. To denote this possibility we use Pgg^ or the probability that drug resistance 
establishes from the standing genetic variation. If the mutation is not already present, 
or if was present but was subsequently lost, then the viral population has to wait for a 
new mutation to occur and become established. We denote this possibility as Pnew, or the 
probability that resistance evolves due to new mutations. In the latter case, we have to 
indicate a time window, such as per year or per generation. 

The goal of this study is to understand and, albeit roughly, quantify Pnew and Pggv for HIV 
drug resistance in patients on triple-drug regimes (consisting of an NNRTI or an unboosted 
PI plus two NRTI's) and in patients who are treated with single dose nevirapine. 

4. Starting standard therapy 

When a patient starts anti-retroviral therapy for the first time, the viral population in 
that patient will move from an equilibrium without drugs to an equilibrium with drugs. At 
the pre-treatment equilibrium, the viral population size will at its equilibrium level (N^), 
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and resistance mutations are expected to be at mutation-selection-drift equilibrium, where 
most mutations will be present at very low frequencies (see, e.g., Paredes 2010). Note that 
mutation-selection-drift equilibrium is reached quickly for mutations that are very costly to 
the virus. So even though it may take years for neutral diversity to reach an equilibrium level 
in an HIV patient (Kouyos et al 2011), important drug resistance mutations which are 5 or 
10% less fit than the wildtype are expected to reach their (dynamic) equilibrium in weeks or 
months. 

Standard population genetic theory predicts that the average frequency of a resistance 
mutation is equal to the mutation rate (//, per viral particle and per replication) divided 
by the relative cost {Crei) of the resistance mutation, though drift causes actual frequencies 
to vary greatly between different time points and between patients (see also Gadhamsetty 
et al 2010). Even though the average frequency is independent of the population size, in 
larger populations, it is more likely that DRMs are present and the absolute number of drug 
resistant particles will, naturally, be higher. When treatment starts, resistance mutations 
will confer a fitness benefit to the virus and they can (but are not guaranteed to) increase in 
frequency and become established. The probability that this happens depends on the number 
of resistant particles in the population and on the establishment probability of a mutation 
that is present in a single particle. In Hermisson and Pennings (2005) we derived formulas to 
calculate the probability that adaptation to a new environment happens from the standing 
genetic variation {Pggv)- We will use the approximate equation 8 in Hermisson and Pennings 
(2005): 

(2) Psg. - 1 - (1 + M^)-2Miv„ 

It is also possible to use the the number of resistant particles in a patient {B) and the 
fitness of these copies (in the environment with drugs) to calculate the probability that a 
resistance mutation becomes established: 

(3) Psgv = 1 - (1 - Pest)"" 

where we use the probability that all copies of the resistance mutation die out to calculate 
the probability that at least one survives. The probability that resistance mutations become 
established increases with the number of copies of resistant virus and the probability that 
any one of these survives. 

Evolution of resistance during therapy. If resistance did not evolve from standing 
genetic variation, it may evolve due to new mutations. The probability that this happens in 
a given year will depend on the number of generations (G) in a year, the mutation rate (/i), 
the effective population size during antiretroviral treatment (Nart) and the establishment 
probability of a mutation {Pest)- In principle, the establishment probability during therapy 
may not be the same as in the very beginning of therapy, for example because the number of 
available cells which a particle can infect could be different. However, throughout this paper 
we will assume that Pest depends only on the kind of therapy and not on how long a patient 
has been treated. Using a poisson approximation, we find that the pre year probability that 
resistance evolves is 



(4) 



Pnew = l-exp {-G Nart A* Pest) 
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It is debated whether during therapy, there is ongoing rephcation or whether a reservoir 
of latently infected cells is entirely responsible to residual viremia. If the reservoir reflects 
the composition of the viral population before treatment, then the expected frequency of the 
resistance mutation in the reservoir would be -J^. If the number of latently infected cells 
that become activated every generation is N^at, then the expected number of activated cells 
with resistant virus would be ^l^^. The per year probability that resistance evolves due to 
activated cells from the reservoir would be 



(5) 



exp 



rel 



It is also possible that there is ongoing replication, but that the reservoir also plays a role 
at the same time, so that the reality will be reflected best by a combination of equations |4] 
and m 



Probability of detecting resistance per year 











Psgv 









Year 2 
Year of treatment 



Figure 1. The probability that resistance is detected for the first time in the first, 
second or third year of treatment, given that it was not detected until then. The bars 
are the estimates from the Margot et al (2006) dataset. The red dashed area reflects 
the inferred probability that resistance mutations from standing genetic variation 
become established. The grey squares are values calculated using equations [2] and |4j 
The red circles are estimated from 1000 simulations. Parameters as in table [l] 

Comparison with data and parameter estimation Published data show that the rate 
of evolution of drug resistance is roughly constant over long times (see for example the study 
by Cozzi-Lepri et al (2010), in which patients were followed for up to eight years). This fits 
with expectations if N art and Pest remain constant so that Pnew stays constant. However, 
several studies show that the probability that resistance mutations become established is 
higher in the first year of therapy, as compared to later years. This can be seen, for example, 
in a study on a large cohort in British Columbia, especially when one considers the most 
adherent group of patients (figure 2 in Harrigan et al 2005, see also Tam et al 2008). A similar 
effect is seen in Li et al (2011) when one considers the patients with pre-existing DRMs. This 
effect, that resistance is more likely to evolve in the first year of therapy as compared to later 
years, can be easily explained by standing genetic variation. 
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Possible combinations of population size and fitness 




100 1000 10000 

Effective population size 



Prob. that DRMs are present before treatment 
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Figure 2. 2a. Continuous line: combinations of population size before treatment 
(Nu) and fitness of mutant virus during therapy {Fijiart that lead to the observed 
probability that resistance mutations from standing genetic variation become estab- 
lished {Pggv = 0.058). Dashed line: combinations of population size during treatment 
(Nart) and fitness of mutant virus during therapy {FmART that lead to the observed 
probability that resistance mutations from standing genetic variation become estab- 
lished {Psgv = 0.058). Open dot: Nu = 2000 and FmART = 1-017, closed dot: 
Nart = 108, Fijiart = 1-017. 2b. Probability that a patient has any pre-existing 
DRMs before the start of therapy for different population sizes, and = 5 * 10~5. 
Open dot: Nu = 2000 

Under the assumption that Pnew is indeed constant, we can use published data to estimate 
both Pnew and Pggv Margot et al (2006) reported the number of patients in which resistance 
was detected in the first, second and third year after treatment initiation in a cohort that was 
treated with NNRTI-based ART. The reported data show that the probability that resistance 
was detected in the first year was 9.5%, whereas in the second and third year it was only 
3.7% (see supplementary material for details on how this was estimated). The difference of 
5.8% is likely due to standing genetic variation at the start of therapy. 

We will use the estimates for Pnew (0.037 per year) and Pggv (0.058) from Margot et 
al (2006), in combination with other, pubfished, estimates to get a rough estimate of the 
important evolutionary parameters. First of all, we will assume that the mutation rate from 
one nucleotide to a specific other nucleotide is 10~^ (Mansky and Tcmin 1995), so that if 
there are five main resistance mutations for a given drug combination, the total mutation rate 
is approximately 5 10~^. For the remainder of the paper, we will only use this total mutation 
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rate. If the mutation rate would be higher (lower) than our assumption, the estimated 
population sizes would be lower (higher) than our estimates. 

We know that the important drug resistance mutations are at least somewhat costly for 
the virus. Their cost, Crei, has been estimated for several drug resistance mutations, both in 
vivo and in vitro (for an overview on resistance mutations in the reverse transcriptase gene 
see Martinez-Picado 2008). For example, Paredes et al (2009) find that the relative cost of 
resistance mutation M184V is approximately 0.04 — 0.08. Wang et al (2011) estimate a cost 
of 0.01 — 0.04 for K103N, which is the most common NNRTI resistance mutation. Other 
studies were not able to detect any cost of K103N, but given its low frequency in untreated 
patients (Paredes et al 2010), it seems likely that it is associated with a significant cost. In 
this paper we will use an average cost of 0.05 for all mutations. 

Given the cost, the mutation rate, Psgv and Pnew, and using the assumption that there are 
200 HIV generations in a year (Fu 2001), we can find the combinations of N^, Nart and 
FmABT that are compatible with the data (shown in figure |2]). Estimates for the effective 
population size in untreated patients range from 10^ (Leigh-Brown 1997) to 10^ (Rouzine and 
Coffin 1999). We know that a large proportion of untreated patients carries low frequency 
drug resistance mutations, but not all patients, which gives us some additional information 
about the population size in an untreated patient (see figure ^p) . If we choose a value of 
of 2 10^, then we find that about half of the patients should carry pre-existing DRMs. This is 
somewhat higher than what is usually detected, but that can be due in part to the limits of 
detection of current tests (Gianella and Richman 2010). An overview of parameter estimates 
which we used in the simulations and for analytical predictions can be found in table [Tj 

Given our choice of A^„, we find that Fitiart niust be approximately 1.017, leading to 
Pest ~ 0.034. Under the assumption Pest stays the same during treatment, the Margot 
et al data are compatible with a 18-fold reduction of population size due to therapy, to a 
population size of Nart ~ 108. Note however, that the estimate of a 18-fold reduction 
depends heavily on the assumption that Crei = 0.05. For example, if we had assumed a 10% 
cost, the estimated reduction would have been 37-fold, and for a 1% cost, the reduction would 
have been only 4-fold. The reason is that if we assume that costs are high, then we must also 
assume that the mutant fitness {Fitiart) is relatively high, in order to find Pggv = 0.06, and 
if FrriART is high, Nart niust be low, to explain Pnew = 0.037. 

If the evolution of resistance during therapy is not due to ongoing replication, but due 
to continuous activation of latent cells, then, under the assumption that Crei = 0.05, the 
number of cells (Nlat) must be approximately 5 per generation. This means a reduction of 
population size of almost 400-fold. However, it is not so clear whether in this case the word 
"population size" should still be used, because the number 5 is not an estimate of the size of 
the reservoir, but an estimate of the size of the part of the reservoir that is reactivated every 
generation. 

The result that the frequency of resistance mutations in the reservoir depends on their 
fitness cost {-q—^), whereas the cost does not play a role for new mutations due to ongoing 
replication, could be harnessed to estimate the relative importance of the reservoir. If the 
reservoir is the most important source of resistance mutations during therapy, then the same 
set of mutations should be found in patients whose virus acquires resistance quickly after the 
start of therapy and in those who acquire mutations during therapy. However, if ongoing 
replication is the source of resistance mutations during therapy, then mutations with a high 
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cost in the absence of drugs should occur relatively more often during therapy than quickly 
after therapy is started. 

The data and the results from simulations and predictions (using equations |2] and |4]) are 
shown in figure [Tj The percentage of patients with resistance after one year is lower in the 
simulations than in the analytical predictions, because it takes time for a mutation to increase 
in frequency and be detected. We assume that it is detected as soon as it is more frequent 
than the wildtype, the result is that in the simulations (and probably in reality) Pnew is lower 
in the first year than in the other two years. 

Table 1. Parameter values for analytical predictions and computer simulations. 



Parameter 


Value 1 Explanation 


Values roughly based on lit- 
erature 






5*10-5 


Mutation rate to resistant genotype 




2000 


Effective population size in untreated patient 


Crel 


0.05 


Relative cost of mutant in absence of therapy 


Fwt ART = Fwt MVP 


0.5 


Absolute fitness of wildtype during therapy 


G 


200 


Number of HIV generations per year 


Values estimated based on 
data 




Nart 


108 


Effective population size in patient on ART 


Nlat 


5 


Number of activated latent colls in patient on ART 


J^ZDV 


1000 


Effective population size in patient on ZDV monotherapy 


Fwtu 


1.62 


Absolute fitness of wildtype in absence of therapy (determines growth rate 
during treatment interruption) 


Frriu 


1.54 


Absolute fitness of resistant mutant in absence of therapy 


FmART 


1.017 


Absolute fitness of resistant mutant during ART 


FniMVP 


1.54 


Absolute fitness of resistant mutant during NVP therapy 


FrriMvp/PP 


1.025 


Absolute fitness of resistant mutant during NVP/PP therapy 


I 


135 


Number of particles from latent cells that enter the population per generation 



Table 2. Abbreviations. 



Abbreviation 


Explanation 


VL 


Viral load, the number of viral particles per ml blood 


ART 


Antiretroviral therapy, here used to mean treatment with two NRTIs and an NNRTI or an "un- 
boosted" PI 


PMTCT 


Prevention of mother to child transmission 


DRM 


drug resistance mutation 


NRTI 


Drug of class nucleoside reverse transcriptase inhibitor 


NNRTI 


Drug of class non-nucleoside reverse transcriptase inhibitor 


PI 


Drug of class protease inhibitor, Pis can be used "unboosted" or "boosted" with an additional drug. 


NVP 


Nevirapine, an NNRTI 


sdNVP 


Single dose nevirapine 


ZDV 


Zidovudine, also known as AZT, an NRTI 


3TC, DDI, FTC, TDF 


Drugs of NRTI class 



5. SiNGLE-DOSE NEVIRAPINE FOR PREVENTION OF MOTHER-TO-CHILD-TRANSMISSION 

A single dose of nevirapine (sdNVP) just before labor starts reduces the risk that a mother 
transmits HIV to her baby at birth, but leads to high levels of resistance in many women. 
Because of the long half life of nevirapine, even a single dose lasts at least a few days. 
However, this is a very short amount of time (only a few HIV generations) so that probably 
most or all detected NVP resistance mutations are due to standing genetic variation. 
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Because it is known that sdNVP can lead to the estabhshment of resistance mutations, 
and also to further reduce the risk that the baby becomes infected with HIV, several dif- 
ferent treatment strategies are being used. In this study, we focus only on those strategies 
that include a single dose of nevirapine (and exclude, for example, pregnancy limited triple- 
drug therapy). Basically, sdNVP can be combined with either a short course of zidovudine 
monotherapy during the third trimester of pregnancy (ZDV/sdNVP), or it can be combined 
with additional drugs during and after labor up to one month postpartum (sdNVP/PP). It 
can also be used alone (sdNVP) or combined with both (ZDV/sdNVP/PP), resulting in four 
possible strategies. 

Under the assumption that all resistance is due to standing genetic variation, it is straight- 
forward to predict, at least qualitatively, the effect of the four treatment options. Single dose 
nevirapine plus two additional drugs (sdNVP/PP) is a three drug regimen, and similar to 
standard antiretroviral therapy (ART), except that it only lasts a few days or weeks. We 
therefore expect similar levels of drug resistance due to standing genetic variation. If only 
NVP resistance is considered (and not resistance to the other two drugs), we expect to find 
somewhat lower levels than in the normal case, although the difference may not be large be- 
cause resistance against NVP is more common than resistance to most other drugs. Treating 
with only sdNVP is different from starting ART, in that there is only one drug. The result 
is that the fitness of both wildtype and resistant virus will not be reduced as much as in the 
normal case. Specifically, NVP resistant virus will have a relatively high fitness during NVP 
monotherapy. This high fitness {Fm^vp) leads to a high establishment probability {Pest) for 
available resistance mutations. In fact, the establishment probability may be so high that in 
virtually all patients that carry some NVP resistance before treatment, the resistant virus 
will increase in frequency during NVP treatment. 

An interesting treatment option is to start with a few weeks of ZDV monotherapy before 
treating with a single dose of nevirapine. The ZDV treatment will reduce the population 
size of the virus, Nu-, so that the probability that NVP resistance is available and the copy 
number of such resistant mutants if they are available will be lower by the time the patient is 
treated with NVP. ZDV monotherapy ultimately leads to ZDV resistance, but the risk that 
resistance mutations become established during a short course is small. ZDV monotherapy 
reduces the viral load approximately three- fold (BrunVezinet 1997). Finally, adding ZDV 
treatment before labor and two additional drugs during and after labor (ZDV/sdNVP/PP) 
will reduce both the availability of NVP resistant virus and the establishment probability of 
such virus, which should lead to an even lower probability that NVP resistance mutations 
from standing genetic variation become established. 

Comparison with data for single dose nevirapine. We identified 23 published studies 
that reported on NVP resistance 6 to 8 weeks after women were treated with sdNVP. Several 
of the studies directly compared two different treatment options. We found at least three 
studies for each of the four different treatment options. An overview of the studies can be 
found in table |4] in the supplementary material. For each study we recorded which of the 
four treatment options was used and in how many of the patients NVP resistance mutations 
were detected using simple Sanger (population) sequencing (we excluded studies that only 
recorded deep-sequencing or allele-specific PGR results, as there were too few of those to 
allow us to compare the treatment options). For each of the four treatment options, we also 
calculated the overall probability that resistance mutations were detected in a patient (simply 
by summing the number of patients with resistance and summing the total number of patients 
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Figure 3. The probability that resistance mutations are detected 6 to 8 weeks 
after treatment with single dose nevirapine. Black crosses are data from single 
studies (the number of studies is indicated in brackets at the top of the graph), 
black squares with estimated standard error are percentages for all studies combined 
(the number of patients that were used to calculate this percentage is indicated 
at the top of the graph). Red circles with standard error are results from 1000 
simulations and the grey squares are analytical predictions. Parameter values as in 
table [1} sdNVP stands for single dose nevirapine, ZDV stands for third trimester 
zidovudine monotherapy, PP stands for the addition of two drugs during and after 
labor (postpartum). 



in the studies). We found that sdNVP leads to detectable resistance mutations in 39% of 
952 patients, ZDV/sdNVP leads to detectable resistance mutations in 22% of 888 patients, 
adding two drugs during and after labor (sdNVP/PP) lead to detectable resistance mutations 
in 7.8% of 372 patients and ZDV/sdNVP/PP lead to detectable resistance mutations in none 
of 292 patients. 

We now used these data, in combination with our previous parameter estimates, to estimate 
the fitness of a NVP resistant mutant during NVP therapy {Fuinvp) and the reduction of 
the population size due to ZDV treatment (Nzdv)- We find that Fmjqyp 1.54 and that 
ZDV reduces the effective population size approximately two-fold (table [l] and figure |3]). 
The results show that a reduction in population size by ZDV monotherapy does reduce the 
probability that NVP resistance mutations become established, but adding two drugs to 
sdNVP helps much more. We also estimate of the fitness of the mutant during therapy 
with nevirapine and two additional drugs and find a slightly higher value than our previous 
estimate (1.025 vs 1.017), because the estimated probability that resistance mutations from 
standing genetic variation become established is slightly higher (0.078 vs 0.058 with standard 
therapy), though these differences are not statistically significant. 
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Figure 4. The probability of the estabhshment of drug resistance mutations from 
standing genetic variation depending on the effective population size and the fitness 
of the resistant mutant during therapy. Grey scales indicate the probability of the 
evolution of drug resistance due to standing genetic variation. Dots indicate esti- 
mated parameter combinations for treatment with just sdNVP, with ZDV monother- 
apy followed by sdNVP (ZDV/sdNVP), by sdNVP followed by two additional drugs 
postpartum (sdNVP/PP) and with ZDV monotherapy followed by sdNVP and two 
additional drugs postpartum ZDV/sdNVP/PP. 



6. Interruption of therapy 

During a treatment interruption, drugs are first removed from the body, which can take 
from a couple of hours to a several days or even weeks (Van Heeswijk 2000, 2001, Pirillo 2010). 
With some delay, depending on the half-life of the drugs, the viral population begins to grow, 
which is observed as an increase of viral load. Published data show that after treatment is 
stopped, viral load quickly increases in almost all patients (e.g., Harrigan 1999). Davey et 
al (1999) show that average viral load plateaus four weeks after treatment is interrupted. 
Garcia et al (2001) and Trkola et al (2005) both report that a plateau is reached between 
four and eight weeks after treatment interruptions. An interruption is ended when treatment 
is started again and viral load goes down, hopefully to undetectable levels. Figure 1 shows a 
cartoon of the pharmacodynamics and population dynamics of a treatment interruption. 

Restarting therapy. If the length of a treatment interruption is so long that the pop- 
ulation size is back to pretreatment level and mutation-selection-drift equilibrium is again 
reached, the probability that resistance mutations become established when therapy is started 
again will equal the probability that resistance mutations become established the first time a 
patient starts treatment, Pggv from equation[2j But if a treatment interruption is shorter than 
that, it is hard to calculate the exact probability that resistance will evolve upon re-initiation 
of therapy because neither population-dynamic, nor population-genetic equilibrium will have 
been reached. The absence of the population-genetic equilibrium is most problematic if re- 
sistance mutations are not very costly to the virus. However, for a costly mutation it takes 
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Figure 5. Drug level (dashed line) and viral population size (solid line) during and 
after a treatment interruption. Red bars indicate when drugs are taken. 

only on the order of 1/Crei generations to reach mutation-selection-drift equihbrium. The 
absence of population-dynamic equilibrium is less problematic, because it is relatively easy 
to predict the population size of the virus or to measure viral load. In the simulations, we 
allow the population to grow exponentially until it reaches the baseline level. The resulting 
population size can be plugged into equation |2] to get an estimate of the probability that 
resistance mutations become established due to a treatment interruption. 

Comparison with data for treatment interruptions. Using the parameter values 
from the last two sections, we can predict the risk that resistance mutations become estab- 
lished due to a treatment interruption of a certain length. We use the estimated fitness of 
the mutant virus during NVP therapy, and assume that the fitness of the mutant in absence 
of drugs is the same. With that value, we can calculate the fitness of the wildtype in the 
absence of drugs, because of the assumption that the cost of the resistance mutation is 5%. 
The wildtype fitness will determine how fast the virus grows in the simulations after treat- 
ment is interrupted, and therefore how long it takes before the population size is back at the 
pretreatment level. Specifically, we use Fyjt = 1.62. In the simulations, the population size 
plateaus after just 14 days, but Pggv reaches its expected value only after 60 days (figure |6]). 

We collected information from structured treatment interruption trials to test the predic- 
tions. The probability that resistance mutations become established due to a single treatment 
interruption was estimated for seven clinical trials with different lengths of treatment inter- 
ruptions (Ananworanich, 2003, Danel, 2009, Hoen, 2005, Palmisano, 2007, Reynolds, 2009 
and 2010, Yerly 2003). An overview of the trials can be found in table (supplementary ma- 
terial). We first calculated the risk under the assumption that all observed resistance was 
due to treatment interruptions and then subtracted the estimated probability that resistance 
mutations become established during therapy. The corrected values are shown in figure |6} 
The data show that longer treatment interruptions indeed lead to a higher risk of resistance. 
The risk plateaus around 37 days, which is consistent with the time it takes for viral load 
to reach its equilibrium level (although the simulations suggest that the risk should plateau 
later than the population size). The highest risk was found to be approximately 6% per 
interruption, just like the risk of starting therapy for the first time. 

7. Discussion 

One aim of our study was to understand and quantify the importance of standing genetic 
variation for the evolution of drug resistance in HIV. We find that the probability that at 
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Figure 6. Estimated probability that resistance mutations become established due 
to a single treatment interruption. Bars are data from seven clinical trials, +/— 
estimated standard error (see table [5| supplementary material), the red circles are 
estimated from 1000 simulations, +/— estimated standard error the grey squares are 
predictions using the average population size from the simulations and equation [2j 
Parameters as in table [Tj 



least one resistance mutation becomes established due to standing genetic variation {Pggv) 
depends on the kind of treatment chosen. Most clearly, it is much higher when treatment 
is with sdNVP (which is monotherapy) than if treatment is with triple-drug combination 
therapy. For standard combination therapy (ART), we use two different data sources to 
estimate the probability that resistance mutations from standing genetic variation become 
established. In the first part of this paper we used data on the number of patients in which 
resistance was detected in the first year of treatment versus later years. In the third part of 
this paper we used data from clinical trials on treatment interruptions. In both cases, we 
found that the probability that resistance mutations from standing genetic variation became 
established was approximately 6%. 

The importance of new mutations as compared to pre-existing mutations could be esti- 
mated from the Margot et al 2006 study. We estimated that the probability that a resistance 
mutation becomes established during therapy (Pnew) is 3.7% per year, which means that pre- 
existing mutations and new mutations are equally important after about one-and-a-half year 
of treatment. Two of the interruption studies also provided estimates for Pnew, which were 
slightly higher (4.3% and 4.8% per year) than the estimate from the Margot et al 2006 study 
(see table [5]). It is likely that some of the patients in these studies were not perfectly adherent 
to treatment, so that our estimate of Pnew is inflated by patients who interrupted treatment. 
This does not affect our estimates of Pggv However, it means that the relative importance 
of pre-existing mutations is highest in completely adherent patients (because new mutations 
are relatively unimportant for them) and lower in non-adherent patients (see Paredes 2010, 
but see Li 2011). 

A stochastic model was used to understand the effect of standing genetic variation on 
the evolution of drug resistance during HIV treatment. Four parameters are crucial to un- 
derstand the role of standing genetic variation. Three of them determine the amount of 
genetic variation that is available (effective population size, mutation rate and cost of the 
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resistance mutations) and one determines how likely it is that the available mutations become 
established (the absolute fitness of the resistant virus during treatment). 

The cost and the mutation rate are parameters that are specific for each specific mutation. 
Together, they determine the expected frequency of the mutant in an untreated patient. 
For example, in untreated patients the frequency of K103N was found to be lower than the 
frequency of Y181C (Paredes et al 2010), suggesting that /i/CreZ is lower for K103N, possibly 
because it reduces the fitness of the virus more. The costs for some of the most important 
mutations (M184V, K103N) have been estimated and are between 1 and 10% (Martinez- 
Picado 2008, Paredes 2009, Wang 2011). Throughout this paper we used a value of 5%. 

The effective population size in an untreated patient (N^) determines how much variation 
there is in the frequency of resistant mutants between patients. If > the frequency in 
each patient will be very close to the expectation, ix/Creh but if iV„ < l//i, there will be a lot 
of variation between patients, and in many patients no resistance mutations may be available 
at all. Data suggest that in HIV the latter is the case (e.g., Paredes et al 2010), which means 
that, contrary to common belief, not every single point mutation is created every generation 
in an HIV patient. A^^ also determines the number of resistant viral particles in a patient with 
a given frequency of the mutant. With higher A^„, there will be a higher number of resistant 
particles, and this makes it more likely that resistance mutations become established when 
treatment is started (Li et al 2011). 

We find that data are compatible with an 18-fold reduction of N due to ART and a two-fold 
reduction of N due to ZDV monotherapy. The estimated reduction depends on the assumed 
cost of mutations; if we assume that mutations are twice as costly, we would find a reduction 
that is twice as severe. Still, the reductions we find are not nearly as severe as one may 
have expected based on viral load reductions. During ART, VL may be reduced 1000-fold or 
more (in the Margot (2006) from which we used the data study, patients had a viral load of, 
on average, 8 10^ before treatment, whereas after 48 weeks of treatment, about 80% of the 
patients had a viral load of less than 50, Gallant 2004). This discrepancy may be due to two 
effects: firstly, our estimate is an average for many patients and this average may be driven 
up by patients in which the drugs do not work well, or who are not adherent to therapy so 
that their VL does not go down as much as expected. Secondly, the relationship between 
effective population size and viral load may not be linear, so that a thousand-fold reduction 
in VL may translate in only a twenty-fold reduction in effective population size. 

The fourth important parameter is the fitness of the mutant virus during treatment (Fm), 
which determines the establishment probability (Pest)- P^ will depend on both the drugs 
that are used and on the specific mutation. For example, the resistance mutation K103N is 
more likely to become established during sdNVP than during triple-drug therapy, because 
additional drugs reduce Fm (FmA^T < Fmjsivp)- And during triple-drug therapy, K103N 
is more likely to become established than Y181C (even though Y181C is present at higher 
frequencies before treatment), likely because Fttiart is higher for K103N than for Y181C. 

StEirting of standard therapy. We assumed that the rate of evolution due to new 
mutations is constant and that the establishment of a resistance mutation from standing 
genetic variation leads to viral failure and is detected within one year of starting therapy. 
Maybe the most convincing evidence for these assumptions comes from the Li et al (2011) 
study, where their figure 2 shows that (1) patients without detected pre-existing DRMs show 
a constant rate of evolution of resistance and (2) patients with detected pre-existing DRMs 
show an increased rate compared to the patients without pre-existing DRMs, but only in the 
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first year of treatment. We used tliese assumptions to estimate tfie probability tliat resistance 
mutations from standing genetic variation become establislied. However, tfie estimated role 
of standing genetic variation may be a slight underestimate, because establishment of new 
mutations should need some time so that Pnew would normally be somewhat lower in the 
first year of treatment. The observation that the effect of standing genetic variation only 
lasts a year, means that fixation of a resistance mutation must take less than a year. This 
limits possible values for Nart and Fttlart to such values for which the fixation time is less 
than 200 generations. For the current study, this means that, had we started with a larger 
population size estimate, then we would have had to choose a higher value of Fttiart in 
order to guarantee that pre-existing resistance mutations are detected (or lost) within a year 
of therapy, which, in turn, would have led to a higher value of Crei to keep Pgg^ ~ 6%. 

If resistance indeed evolves due to standing genetic variation in 6% of patients on standard 
ART, then there is clearly room for improvement. Note that those 6% of patients have 
already lost their first treatment option shortly after having started treatment. They have 
to switch to second-line treatment which is more expensive, usually more complicated (more 
pills per day) and likely has more side effects. It is therefore worth exploring ways to avoid 
the establishment of resistance mutations from standing genetic variation. Figure |4] suggests 
two options to reduce Pggv^ by reducing the population size or by reducing the fitness of 
the resistant mutants. The first may be achieved by ZDV monotherapy, as shown in the 
section on PMTCT, whereas the second may be achieved by adding additional drugs to the 
treatment. Obviously, triple-drug combination treatment is already standard for most HIV 
patients, but it may be worth considering specifically which treatment options would be 
best to prevent the evolution of resistance from standing genetic variation. This may mean, 
for example, to add a fourth drug to the therapy in the first couple of weeks of treatment. 
Finally, when it becomes possible to detect low-frequency DRMs routinely in patients before 
starting treatment, it may be possible to wait for a moment when their frequency is low to 
start therapy. 

Resistance due to sdNVP. Studying treatment with a single dose of nevirapine gives 
us a unique opportunity to study the effect of standing genetic variation, because treatment 
is so short (only a few HIV generations) that we can assume that most or all resistance 
mutations that are detected are from standing genetic variation. Data show that the risk 
that resistance mutations become estabhshed due to such treatment is very high (39%). We 
find that this high probability can be explained entirely by selection on pre-existing drug 
resistance mutations, because the fitness of NVP resistant virus is probably very high during 
NVP monotherapy. We estimate that its fitness is approximately 1.5. Reducing this fitness, 
by adding additional drugs or reducing the availability of the mutant by reducing population 
size will both reduce the probability that resistance is established. A study from Zambia (Chi 
et al 2007) showed that the additional drugs even help to reduce the establishment of NVP 
resistance mutations if the drugs are given as a single dose (in stead of treatment for a couple 
of days or weeks). We did not include this study in the overview, because there was only one 
study that looked at this treatment option. However, the results are interesting: they find 
that even a single dose of TDF/FTC reduces the probability that resistance mutations are 
found 6 weeks after a sdNVP from 25 to 12 %. 

The results on ZDV/sdNVP/PP treatment (i.e., treatment with ZDV during pregnancy 
and NVP plus two other drugs during labor) are surprising in that NVP resistance mutations 
were not detected in any of the women who received this treatment, even though the model 
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would predict that mutations would be detected in 4% of the women. Most of the data on 
this treatment option are from the Lallemant 2010 paper (222 women). In this study, the 
authors do find some mutations that confer resistance to the NRTI's in the study (in 2.3% 
of the women). The same study also looked at women who were treated with ZDV/sdNVP 
and also in these women the percentage with resistance mutations was very low (6.4%) and 
much lower than the mean value for women who receive this treatment (22%). The reason 
for the surprisingly low values of drug resistance in this study could be that the women in the 
study had very low viral loads (median 2800), which is much lower than in most untreated 
patients. This probably also means that they have a low effective population size. It therefore 
seems unlikely that the extremely good results from the Lallemant study can be replicated 
in other populations. However their results still show that using additional drugs to reduce 
the population size and to reduce the fitness of the mutant may be a good strategy to reduce 
the probability that resistance becomes established. 

Treatment interruptions. Considering treatment interruptions, our model provides 
several testable predictions. 1) resistance mutations are more likely to become established 
after long treatment interruptions when viral loads are higher, 2) the risk that resistance 
mutations become established due to a treatment interruption can not be larger than the 
risk at the start of treatment, 3) treatment interruptions increase the risk of establishment 
of resistance mutations even for drugs with short half-lifes. 

Data from seven clinical trials show that indeed, longer interruptions increase the prob- 
ability that resistance mutations become established (figure [6]). Moreover, the estimated 
probability appears to plateau after 37 days, which is similar to the time it takes for viral 
load to reach its pretreatment level. This suggests that the risk of establishment of resis- 
tance mutations is directly linked to the viral load when treatment is started again. However, 
in our simulations, population size plateaued much earlier than the risk of resistance. Un- 
fortunately, the resolution of both viral load measurements and of the data on treatment 
interruptions are not good enough to determine whether in reality viral load indeed plateaus 
before the risk or not. If both would plateau at the same time, it would suggest that the 
resistance mutations are more costly than we had assumed. The second prediction was also 
found to hold: the estimated risk that resistance mutations from standing genetic variation 
become established at the start of treatment was found to be similar to the risk due to a 
long treatment interruption (6% in both cases). The third prediction also holds, as data 
show that interruptions increase the risk of establishment of resistance mutations even for 
PI based treatment (Yerly 2003 and Hoen 2005), where the "tail of monotherapy" cannot 
explain the observations. 

A potential problem with the data is that not only the length of the interruptions, but 
also the length of treatment periods between the interruptions differed between the seven 
studies. The trials that were compared also differed in the drugs that were used (see table 
[5] in supplementary material), which makes direct comparison difficult. Despite all these 
limitations, it becomes clear that longer interruptions carry a higher risk of evolution of 
resistance than shorter interruptions. 

If interruptions lead to the establishment of resistance mutations only due to the "tail of 
monotherapy", as is usually assumed in the HIV literature (Taylor 2007, Fox 2008, Gardner 
2010), we would predict that: 4) treatment interruptions increase the risk that resistance 
mutations become established only for drugs with long half-lifes, 5) the risk that resistance 
mutations become established due to a treatment interruption is unrelated to the risk at 
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the start of treatment and 6) the largest risk would be due to an interruption with a length 
that is exactly the time it takes for the last drug to loose its effect on the wildtype virus. 
All of these predictions do not hold. This is not to say that the "tail of monotherapy" is 
not important at all. But it does show that on its own, the "tail of monotherapy" cannot 
explain the risk that resistance mutations become established due to treatment interruptions. 
When one considers possible intervention strategies, this may be good news. If treatment 
interruptions are risky because of restarting rather than stopping therapy, this would give 
doctors a possibihty to reduce the risk that resistance mutations become established even 
after a patient has already stopped taking his or her drugs. 

The data on single dose nevirapine clearly show that monotherapy makes it likely that 
resistance mutations become established, so intuitively, one would expect that monotherapy 
should also be important also in patients who interrupt treatment, which is why the "tail of 
monotherapy" hypothesis is so popular. However, in patients who are treated with sdNVP, 
the resistance mutations are most likely already present, because it is the first treatment 
they receive and the viral population will be large. On the other hand, if a fully suppressed 
patient interrupts therapy, a few days of monotherapy may not have a large effect, because 
the population size of the virus will be very small. Monotherapy in those patients would 
still lead to a high establishment probability of resistance mutations, but if such mutations 
are not present in the population, they cannot become established. The situation may be 
different in patients who take treatment very irregularly, so that they are more likely to have 
higher viral loads when treatment is stopped. This may explain why resistance to NNRTIs, 
which are long half-hfe drugs, is likely to evolve in non-adherent patients (Tam et al 2008). 

General remarks. We have used a population-dynamic and population-genetic model 
to study several patterns of drug resistance in HIV. The model explains why resistance 
mutations are likely to be established in the first year of standard treatment, in women who 
are treated with a single dose of nevirapine and in patients who interrupt treatment. In all 
three cases, standing genetic variation can explain the observations. 

Our results illustrate that for adaptive evolution to happen, selection and the creation of 
new variation need not happen at the same time, if selection can work on standing genetic 
variation. In the case of antiretroviral treatment, this means that insufficient drug levels 
(which allow for replication and selection at the same time) are not a necessary condition for 
the evolution of drug resistance, although they would speed up such evolution. This result 
about time-heterogeneous drug levels is similar to the result on heterogeneity in space by Ke- 
pler and Perelson (1998), who showed that genetic variation may be created in compartments 
where drugs cannot penetrate whereas selection happens in other compartments. 

Note that our model provides a simple explanation for why resistance is less likely to 
evolve when patients are treated with multiple drugs at the same time in stead of just one 
drug. Additional drugs reduce the fitness of a mutant that is resistant against one drug, and 
therefore the estabhshment probability of such a resistant mutant. In addition, additional 
drugs reduce the population size of the virus and thereby the creation of new resistance 
mutations. This means that there will be fewer resistance mutations with lower establishment 
probabilities, together leading to a strong reduction in the probability that resistance evolves. 
This model therefore explains why the evolution of drug resistance is slower with triple-drug 
therapy, but not completely halted, as escape due to one mutation is still possible. In newer 
therapies with boosted Pis, drug resistance has become very rare (Cozzi-Lepri 2010), which 
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may be because boosted Pis are so strong that no single mutation can lift the virus' fitness 
above 1. 

The model in this study, which includes the effect of population size on the probability 
that resistance mutations become estabhshed may be relevant to other diseases than HIV. 
For example, the evolution of resistance is a problem in chronic myeloid leukemia (CML) 
which is a cancer of white blood cells. A recent study suggested that the probability that 
drug resistance evolves in CML goes down with time because the population size of the cancer 
goes down with time (Tomasetti, 2011, BCJ). 

Resistance is also a problem in tuberculosis (TB) , and in TB it is also known that treatment 
interruptions increase the risk of evolution of resistance (Weis 1994). It should be investigated 
whether this effect may also be due to an increased population size during the interruptions. 
In general, stopping treatment may be risky when treatment has to be started again, which is 
always the case for HIV and often for TB. Each time therapy is started, resistance mutations 
from standing genetic variation may become established, and even if this risk is only a few 
percent it adds up quickly when patients interrupt treatment regularly. 
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10. Supplementary material 

10.1. Model. The model describes population dynamics and population genetics of a pan- 
mictic virus population in a single patient. Only virus in infected cells is considered and the 
dynamics of free virus and uninfected cells are ignored. The total number of cells that can be 
infected is limited to the carrying capacity. Because the life cycle of the virus is simplified 
to just one step in which infected cells "give birth" to infected cells, all drugs have the same 
effect: they reduce the number of cells that can be infected by the virus in an infected cell. 
We will refer to the infected cells by the virus they are infected with. 

When patients fail therapy, this is often due to virus that is resistant against just one 
drug even if the patient was treated with three drugs (as described in the Introduction). We 
therefore assume in our model that there are just two types of viruses: wildtype virus that 
is susceptible to all drugs and resistant virus that is resistant against one of the drugs in the 
regimen of the patient. We assume that prior to establishment of single drug-resistant virus 
(hereafter: resistant virus), there is no virus present that is resistant against more than one 
drug. We only focus on the probability that single drug-resistant virus becomes established. 

The resistant type has a growth rate larger than 1 even in the presence of drugs, but in 
the absence of drugs it is less fit than the wildtype. Initially the population consists of only 
wildtype virus, resistant virus can be created by mutation. Absolute fitnesses are determined 
for wildtype and resistant virus, in the absence of drugs (denoted by u for untreated) and 
during treatment (either ART or NVP). These fitnesses are absolute fitnesses, which are 
equivalent to the reproductive ratio, or the expected number of cells a viral particle would 
infect in a completely susceptible population of host cells. Without drugs, wildtype virus is 
fitter than resistant virus, so that the population will always be dominated by wildtype virus. 
During treatment, resistant virus is fitter than wildtype, so that the population will ultimately 
become dominated by resistant virus. However, if evolution is mutation-limited, it can take 
a long time before the equilibrium with resistant virus is reached. One of the successes of 
combination therapy is to make this waiting time much longer than with monotherapy. 

The relative cost or selective disadvantage is defined as Crei — {Fwty, — Fmu)/{Fwty), or, 
in other words, how much less fit resistant virus is in the absence of drugs, relative to the 
wildtype. As will be shown later, the fitness of the wildtype with drugs is not important in 
the current model (although we do have to determine its value for the simulations). 

If the average fitness of the viral population is larger than 1, the viral population grows 
exponentially until it reaches the carrying capacity (K), at which point the population stops 
growing and stays stable. If the average fitness of the population is below 1, the population 
shrinks until it reaches //(I — F^*), where / is a fixed number of particles which is added to 
the population each generation. This refiects the latently infected cells in an HIV infected 
patient. These latently infected cells have very long half-hfes and function as a reservoir for 
the HIV population. Because of these cells, the virus population does not die out, even if 
therapy is very successful and there is (almost) no replication. 

Simulation details. Time is counted in generations. Each generation is split in 10 time- 
steps. At each time step, each viral particle has a probability of 1/10 to be chosen to reproduce 
and die. The expected life-time of a viral particle is therefore 1 generation and this is the same 
for wildtype and resistant virus. All particles that are chosen to reproduce at a given time-step 
die and are replaced by their offspring (i.e. newly infected cells). The number of offspring is 
equal to [number of chosen wildtype particles] *[F^i]-|- [number of chosen resistant particles]* 
[Fres]- If this number is smaller than 20, stochastic effects should be taken into account, so 
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instead of using the number directly, we take a random number from the Poisson distribution 
with the expected number as the mean. If adding of the new offspring to the population would 
lead the population size to be higher than we add fewer individuals so that the population 
size will be exactly K. The new offspring can be either wildtype or resistant. It is assigned 
a type at random, with probability 

[number of chosen wildtype particles] * [Fyjt\ 
[number of chosen wildtype particles] * [F^j^ + [number of chosen res particles] * [F^es] 

to be wildtype, and resistant otherwise. This is a standard procedure (also used in Hermisson 
and Pennings 2005) The choice of 10 time-steps per generation is a compromise between the 
speed of simulating discrete generations and the reality of continuous replication. 

Mutation occurs only from wildtype to resistant. Back-mutation to wildtype is ignored. 
The number of mutations in a generation depends on the (per particle and per generation) 
mutation rate, and the number of newly created infected cells, which in turn depends on 
fitness and population size. 

Analytical calculation of the fixation probability. To calculate the probability that 
resistance mutations become established when treatment is started (either for the first time 
or after an interruption), there are two possibilities. When the relative cost of resistance in 
the absence of drugs, the fitness of the resistant virus with drugs and the viral population size 
are all known, we can use the theory from Hermisson and Pennings (2005) to calculate the 
probability that resistance mutations from the standing genetic variation become established. 
Alternatively, if the number of copies of the resistant virus and the fitness of resistant virus are 
known, it is possible to calculate the probability that at least one of these copies will survive, 
in which case the population becomes resistant. For the latter, we need the probability 
that a single resistant mutant spreads in the population and ultimately goes to fixation, as 
opposed to being lost due to genetic drift. In the current model, individuals (viral particles 
in infected cells) will "produce" a Poisson distributed number of newly infected cells (we call 
these offspring). The mean number of offspring. A, determines the probability that a lineage 
dies out or survives. Usually, it is assumed that the mean number of offspring of a genotype 
depends on its fitness relative to the mean fitness of the population. However, when a patient 
takes drugs, the population size of the virus will be much lower than the carrying capacity, 
so that there is effectively no competition between resistant and wildtype virus. The mean 
number of offspring. A, therefore does not depend on the relative fitness, but simply equals 
the absolute fitness of the resistant virus. In 1927, Haldane showed that the relationship 
between the mean number of offspring (A) and the fixation probability is 



Traditionally, the fixation probability is calculated using the selection coefficient, — 
Fres^Fwt ^ instead of the expected number of offspring, A. In a population with stable pop- 
ulation size and competition between genotypes this is fine because A = 1 -|- s^; but in the 
current model, this does not hold and Sf, is irrelevant. 

If A — 1 is small, a simple approximation for the fixation probability is: 



(7) 



Pf,, ^ 2(A - 1) 
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When A is larger, Haldanes equation needs to be solved numerically using more terms of 
the Taylor expansion. 

10.2. Data from clinical trials. Starting standard therapy. The probability that resis- 
tance mutations from standing genetic variation become established at the start of therapy 
was estimated from a data set that was published in Margot (2006), see table [s] for the 
raw data. The study reported the number of patients where resistance was detected in the 
first, second and third year of treatment, for two treatment groups and for three categories 
of resistance mutations. Because in each case only the first resistance mutation in a given 
patient was recorded, the data for the three different resistance classes could not be analyzed 
separately. A saturated model was fitted using R (R Development Core Team, 2005) after 
which non-significant interactions were removed. It was found that the probability that re- 
sistance mutations became established did not differ significantly between the two treatment 
groups (p=0.26) and also not between year 2 and year 3 (p=0.78). However, there was a 
clear difference between year 1 and the other two years {p < 10^^). The average risk per 
year in year 2 and 3 was 3.7%, whereas the average risk in year 1 was 9.5%. The difference 
between year 1 and the other 2 years was 5.8%, which could be due to the standing genetic 
variation before treatment started. 

Single dose nevirapine. We searched the web of science (ISI) database for published 
studies on single dose nevirapine and resistance. We limited ourselves to studies that reported 
the fraction of women that had detectable resistance at 4 to 6 weeks postpartum using 
standard sequencing. An overview of the studies can be found in table |4} 

Interruption studies. Seven studies were identified that reported on clinical trials in 
which patients interrupted treatment multiple times for a fixed length of time (i.e., all pa- 
tients in the trial were on the same schedule) and which tested genotypic resistance on plasma 
samples of all patients in which treatment failed (and ideally of all patients) before and after 
the trial. Characteristics of the seven studies can be found in table |5] (supplementary mate- 
rial). The shortest interruption length was two days and the longest 60 days. If possible, we 
removed patients which had evidence of resistance before the trial started. In all but one of 
the studies, patients were on traditional triple-drug regimens (NNRTI or PI based with two 
NRTIs). Only in the Staccato study some patients were on a boosted PI regimen (Anan- 
woranich et al, 2003). These patients were taken out of the analysis because resistance seems 
to evolve much slower in regimens with boosted Pi's compared to NNRTI and unboosted PI 
regimens (Lima, 2008). In two trials, the interruptions had varying lengths (Hoen 2005 and 
Palmisano 2007), in which case we used the mean length of the interruptions for the plot 
(these points are in grey in the plot, see figure [6]). 

For each study, the fraction of patients {F) which did not acquire any resistance mutations 
during the trial was calculated. The number of treatment interruptions (T) which were 
relevant for the resistance data was calculated. For example, if genotypic resistance tests were 
done on samples that were obtained during treatment interruption 4, it was assumed that any 
resistance had occurred due to treatment interruption 1, 2 or 3 so that T = 3. Furthermore 
it was assumed that each treatment interruption (TI ) contributed equally to the probability 
that resistance mutations became established. The probability (P) that resistance mutations 
became established due to a single interruption was calculated as follows: 



(8) 



F ={1-Pf ^ P = l- F^/^ 
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Two of the studies also allowed for an estimation of the risk that resistance mutations 
became established during continuous treatment (Reynolds 2009 and 2010, and Danel 2009). 
We find a rate of evolution of resistance of 4.3% and 4.8% per year (see table [s]). Using 
the mean of these two rates, we can estimate the probability that resistance mutations be- 
came established during the time on treatment in each of the trials and we corrected the 
estimated risk of an interruption by subtracting the probability that resistance mutations 
became established during treatment. 

10.3. Supplementary tables. 
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Table 3. Number of patients with at least one resistance mutation detected by the 
end of the first, second and third year of NNRTI-based antiretroviral therapy. Data 
from Margot et al (2006). 



Year 


Group 


Resistant 


Not Resistant 


1 


TDF 


35 


264 


2 


TDF 


8 


256 


3 


TDF 


9 


247 


1 


d4T 


22 


279 


2 


d4T 


10 


269 


3 


d4T 


11 


258 
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Table 4. Overview of clinical trials with single dose nevirapine which reported the 
number of patients with nevirapine resistance detected 6 to 8 weeks after treatment. 



Reference and name trial 


Number of 
patients 


Treatment 


Treatment code 


Est. prob of es- 
tablishment of 
resistance 


Remarks 


Esnlcman 2005, Malawi 


65 


SQlN V r 


SQlN V r 


0.69 




Eslilenian 2005, Uganda 




SQIN V -T 


SQ^N V i 






i^arr 2010, Malawi 


65 


sdNVP 


sdNVP 


0.34 




rluaelson 2U1U , Uganda 


30 


saiN VP 


saiN V P 


0.43 




Jackson 2000, Uganda 


15 


sdNVP 


sdNVP 


0.20 




Kassayc 2U07, Zimbabwe 


32 


sdlN VP 


saN VP 


0.35 




Lcc zOUo, Zimbabwe 


32 


sdlN VP 


sdJN VP 


0.34 




Loubscr 20()o, south Arrica 


44 


sdNVP 


sdNVP 


0.54 


Results only for 
K103N mutation 


Ly 2UU7, Oamboaia 


35 


sdJN VP 


sdN VP 


0.23 




Martinson 2009, South 


108 


sdNVP 


sdNVP 


0.38 




Airica, rliVJNhji 012 












Martinson 2009, South 


193 


sdNVP 


sdNVP 


0.46 




A r ■ TTTT T'MTTl'Ti r\-l '"1 

Airica, HlVMbi 012 












Mclntyrc 2009, South 


74 


sdNVP 


sdNVP 


0.59 




Africa 












Rajesh 2010, India 


12 


sdNVP 


sdNVP 


0.33 




Toni 2005, Ivorycoast 


29 


sdNVP 


sdNVP 


0.21 




Farr 2010, Malawi 


120 


sdNVP + 7 days 
3TC/ZDV 


sdNVP/PP 


0.017 




Mclntyrc 2009, South 


164 


sdNVP + 4 days 


sdNVP/PP 


0.097 




Africa 




3TC/ZDV 








Mclntyre 2009, South 


168 


sdNVP + 7 days 


sdNVP/PP 


0.073 




Africa 




3TC/ZDV 








Chaix 2007, Ivorycoast, 


63 


ZDV from 36 weeks + 


ZDV/sdNVP 


0.33 




ANRS/Ditrame Plus 




sdNVP 








Chalermchokcharoenkit 


190 


ZDV 3rd trimester + 


ZDV/sdNVP 


0.18 




2009, Tiiailand 




sdNVP 








Chi 2009 and 2007, Zambia 


166 


ZDV 3rd trimester + 
sdNVP 


ZDV/sdNVP 


0.25 




Lallemant 2009, Thailand, 


222 


ZDV 3rd trimester + 


ZDV/sdNVP 


0.064 




PHPT2 




sdNVP 








Ly 2007, Cambodia 


16 


ZDV from 28 weeks + 
sdNVP 


ZDV/sdNVP 


0.19 




Shapiro 2006, Botswana 


155 


ZDV from 34 weeks + 
sdNVP 


ZDV/sdNVP 


0.45 




Van Zijl 2008, South Africa 


76 


ZDV from 34 weeks + 
sdNVP 


ZDV/sdNVP 


0.17 




Arrive 2010, Cambo- 


33 


ZDV from enrollment 


ZDV/sdNVP/PP 


0.0 




dia /Ivorycoast /South 
Africa, TEmAA ANRS 




+ sdNVP + 1 week 
TDF/FTC 








12109 












Dabis 2009, Cambo- 


37 


ZDV from enrollment 


ZDV/sdNVP/PP 


0.0 




dia/Ivorycoast /South 




+ sdNVP + 1 week 








Africa, TEmAA ANRS 




TDF/FTC 








12109 












Lallemant 2009, Thailand, 


222 


ZDV 3rd trim + 


ZDV/sdNVP/PP 


0.0 




PHPT4 




sdNVP + 1 month 
ZDV/DDI 
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Table 5. Overview of clinical trials with structured treatment interruptions which 
reported the number of patients with at least one drug resistance mutation detected. 



Reference and name 


% on 


Number 


Patients 


Patients 


Fraction 


TPs 


Length 


Treatment 


Est. prob 


Corrected 


trial 


PI 


of pa- 


excluded 


with 


not re- 


rele- 


of TI 


period 


of evo- 


prob of 






tients 




genotypic 
resistance 


sistant 
at end of 
trial 


vant for 
calcula- 
tion 


(days) 


(days) 


lution of 
resistance 
per TI 


evolu- 
tion of 
resistance 


Interruption arms 




Reynolds 2009 


2% 


57 


1 (no geno- 


4 


52/56 


72 


2 


5 








and 2010, Uganda 






type) 
















FOTO arm 






















Ananworaiiicli 


0% 


36 


22 (on 


3 


9/12 


17 


7 


7 


0.02 


0.02 


2003, Staecato 






boosted 
















WOWO arm 






PI) 2 (had 
resistance 
before 
trial) 
















Reynolds 2009 


6% 


32 


3 (left the 


9 


20/29 


18 


7 


7 


0.02 


0.02 


and 2010, Uganda 






trial) 
















WOWO arm 






















Yerly 2003 , SSITT 


100% 


87 


4 (lost to 


11 


69/80 


4 


14 


56 


0.04 


0.03 


trial 






follow-up) 
3 (already 
resistance 
before 
trial) 
















Hoen 2005, ANRS 


100% 


26 


4 (re- 


2 


19/21 


2 


14, 28 


84 


0.05 


0.04 


100 Primstop trial 






sistance 
before 

trial) 1 
(resistance 
in first TI) 








(mean 
21) 








Palmisano 2007, 


25% 


136 


16 (es- 


22 


98/120 


3 


28, 


91 


0.07 


0.06 


IS S -PART 






timated 
number 
that had 

resistance 
before) 








28, 56 

(mean 

37) 








Danel 2009, 


10% 


325 


10 (no 


76 


239/315 


4 


60 


91 


0.07 


0.06 


Ivory Coast NOT 






genotypes) 
















00158405 






















Continuous arms 




Reynolds 2009 and 


2% 


51 





3 


48/51 






504 




0.043/year 


2010, Uganda con- 






















tinuous arm 






















Danel 2009, 


15% 


110 


3 (no geno- 


10 


97/107 






728 




0.048/year 


Ivory Coast NCT 






types) 
















00158405 























